Search CORE

4 research outputs found

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Author: Axelrod Vera
Bapna Ankur
Beaufays Françoise
Chen Nanxin
Chen Zhehuai
Chiu Chung-Cheng
Haghani Parisa
Han Wei
Hu Ke
Li Bo
Meng Zhong
Moreno Pedro
Park Daniel S.
Perng Ginger
Prabhavalkar Rohit
Qin James
Ramabhadran Bhuvana
Riesa Jason
Rosenberg Andrew
Sainath Tara
Schalkwyk Johan
Soltau Hagen
Strohman Trevor
Wang Gary
Wang Yongqiang
Wu Yonghui
Zhang Yu
Publication venue
Publication date: 24/09/2023
Field of study

We introduce the Universal Speech Model (USM), a single large model that performs automatic speech recognition (ASR) across 100+ languages. This is achieved by pre-training the encoder of the model on a large unlabeled multilingual dataset of 12 million (M) hours spanning over 300 languages, and fine-tuning on a smaller labeled dataset. We use multilingual pre-training with random-projection quantization and speech-text modality matching to achieve state-of-the-art performance on downstream multilingual ASR and speech-to-text translation tasks. We also demonstrate that despite using a labeled training set 1/7-th the size of that used for the Whisper model, our model exhibits comparable or better performance on both in-domain and out-of-domain speech recognition tasks across many languages.Comment: 20 pages, 7 figures, 8 table

arXiv.org e-Print Archive

Providing content-based services in a peer-to-peer environment

Author: Chenxi Wang
Ginger Perng
Michael K. Reiter
Publication venue
Publication date: 01/01/2004
Field of study

Information dissemination in wide area networks has recently garnered much attention. Two differing models, publish/subscribe and rendezvous-based multicast atop overlay networks, have emerged as the two leading approaches for this goal. Event-based publish/subscribe supports contentbased services with powerful filtering capabilities, while peer-to-peer rendezvous-based services allow for efficient communication in a dynamic network infrastructure. We describe Reach, a system that integrates these two approaches to provide efficient and scalable content-based services in a dynamic network setting.

CiteSeerX

SWAP: Shared Wireless Access Protocol (using Reciprocity)

Author: David G. Andersen
Ginger Perng
Matthew W. Dunlop
Publication venue
Publication date: 01/06/2007
Field of study

Abstract — Wireless access points are becoming more and more prominent in the home, yet there is no incentive to encourage access point owners to share their service. We introduce SWAP, a lightweight protocol that uses reciprocity to motivate users to share service. Each node participating in SWAP stores perishable receipts that are used to calculate a user’s rating (how much the user shares his or her access point). SWAP does not use a centralized authority to store or validate receipts nor does it place an excessive burden on peers. SWAP is also robust against collusion, which we show through analysis and implementation. As demonstrated by an implementation of the most computationally expensive portions of the protocol, SWAP imposes little overhead even on mobile devices. Keywords—Community Wireless, Reciprocity, Wireless Security I

CiteSeerX

Crossref

Censorship Resistance Revisited ⋆

Author: Chenxi Wang
Ginger Perng
Michael K. Reiter
Publication venue
Publication date
Field of study

Abstract. “Censorship resistant ” systems attempt to prevent censors from imposing a particular distribution of content across a system. In this paper, we introduce a variation of censorship resistance (CR) that is resistant to selective filtering even by a censor who is able to inspect (but not alter) the internal contents and computations of each data server, excluding only the server’s private signature key. This models a service provided by operators who do not hide their identities from censors. Even with such a strong adversarial model, our definition states that CR is only achieved if the censor must disable the entire system to filter selected content. We show that existing censorship resistant systems fail to meet this definition; that Private Information Retrieval (PIR) is necessary, though not sufficient, to achieve our definition of CR; and that CR is achieved through a modification of PIR for which known implementations exist.

CiteSeerX